# Multimodal Live Streaming
Minicpm V 2 6
MiniCPM-V is a mobile GPT-4V-level multimodal large language model that supports single-image, multi-image, and video understanding, equipped with visual and optical character recognition capabilities.
Image-to-Text
Transformers Other

M
openbmb
91.52k
969
Minicpm V 2 6 Int4
MiniCPM-V 2.6 is a multimodal vision-language model supporting image-to-text conversion with multilingual processing capabilities.
Image-to-Text
Transformers Other

M
openbmb
122.58k
79
Featured Recommended AI Models